An Effective Algorithm for SNP Haplotype Block Inference

نویسندگان

  • Chia-Ling Sun
  • Chang-Biau Yang
  • Yow-Ling Shiue
  • Hsing-Yen Ann
چکیده

Recently, it has been shown that there should exist a block-like structure in human genome, and thus only limited haplotype diversity can be obtained. In this paper, we propose a fixed-diversity strategy to find out the suitable block diversity and block boundaries. The diversity value in one block is defined as d 1 ∑i 1 xi 2, where xi denotes the frequency ratio of the ith kind of haplotype within the block and n denotes the number of distinct types of haplotype. We figure out that once a putative block stretches across the primary block boundaries, the diversity will increase rapidly. And the secondary block boundary effects occur when two types are merged or split into different types. The threshold in our algorithm is decided by the two detections of the primary and secondary block boundary effects. We obtain a reasonable diversity of chromosome 21 SNP data with our algorithm. Our partition result shows highly concurrence property to the haplotype data downloaded from NCBI website.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Haplotype Block Partitioning and tagSNP Selection under the Perfect Phylogeny Model

Single Nucleotide Polymorphisms (SNPs) are the most usual form of polymorphism in human genome.Analyses of genetic variations have revealed that individual genomes share common SNP-haplotypes. Theparticular pattern of these common variations forms a block-like structure on human genome. In this work,we develop a new method based on the Perfect Phylogeny Model to identify haplo...

متن کامل

HapBlock – A Suite of Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Based on Haplotype and Genotype Data

The suite of programs, HapBlock, is developed for haplotype block partitioning and tag SNP selection under the joint guidance of Ting Chen, Fengzhu Sun, and Michael Waterman within the Center for Computational and Experimental Genomics at the University of Southern California and with collaboration with Zhaohui Qin and Jun Liu in the department of statistics at Harvard University. This suite of...

متن کامل

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

Recent studies have revealed that linkage disequilibrium (LD) patterns vary across the human genome with some regions of high LD interspersed by regions of low LD. A small fraction of SNPs (tag SNPs) is sufficient to capture most of the haplotype structure of the human genome. In this paper, we develop a method to partition haplotypes into blocks and to identify tag SNPs based on genotype data ...

متن کامل

Inference of missing SNPs and information quantity measurements for haplotype blocks

MOTIVATION Missing data in genotyping single nucleotide polymorphism (SNP) spots are common. High-throughput genotyping methods usually have a high rate of missing data. For example, the published human chromosome 21 data by Patil et al. contains about 20% missing SNPs. Inferring missing SNPs using the haplotype block structure is promising but difficult because the haplotype block boundaries a...

متن کامل

Robustness of Inference of Haplotype Block Structure

In this report, we examine the validity of the haplotype block concept by comparing block decompositions derived from public data sets by variants of several leading methods of block detection. We first develop a statistical method for assessing the concordance of two block decompositions. We then assess the robustness of inferred haplotype blocks to the specific detection method chosen, to arb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005